Overview
Brought to you by YData
Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 421570 |
| Missing cells | 1422431 |
| Missing cells (%) | 21.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 86.0 MiB |
| Average record size in memory | 214.0 B |
Variable types
| Numeric | 13 |
|---|---|
| DateTime | 1 |
| Boolean | 1 |
| Categorical | 1 |
MarkDown1 is highly overall correlated with MarkDown4 and 1 other fields | High correlation |
MarkDown4 is highly overall correlated with MarkDown1 | High correlation |
MarkDown5 is highly overall correlated with MarkDown1 and 1 other fields | High correlation |
Size is highly overall correlated with MarkDown5 and 1 other fields | High correlation |
Store is highly overall correlated with Type | High correlation |
Type is highly overall correlated with Size and 1 other fields | High correlation |
IsHoliday is highly imbalanced (63.3%) | Imbalance |
MarkDown1 has 270889 (64.3%) missing values | Missing |
MarkDown2 has 310322 (73.6%) missing values | Missing |
MarkDown3 has 284479 (67.5%) missing values | Missing |
MarkDown4 has 286603 (68.0%) missing values | Missing |
MarkDown5 has 270138 (64.1%) missing values | Missing |
Reproduction
| Analysis started | 2025-08-23 19:07:00.043150 |
|---|---|
| Analysis finished | 2025-08-23 19:07:54.319918 |
| Duration | 54.28 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Store
Real number (ℝ)
High correlation 
| Distinct | 45 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.200546 |
| Minimum | 1 |
|---|---|
| Maximum | 45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 22 |
| Q3 | 33 |
| 95-th percentile | 43 |
| Maximum | 45 |
| Range | 44 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 12.785297 |
|---|---|
| Coefficient of variation (CV) | 0.57590014 |
| Kurtosis | -1.1465028 |
| Mean | 22.200546 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.077762502 |
| Sum | 9359084 |
| Variance | 163.46383 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 13 | 10474 | 2.5% |
| 10 | 10315 | 2.4% |
| 4 | 10272 | 2.4% |
| 1 | 10244 | 2.4% |
| 2 | 10238 | 2.4% |
| 24 | 10228 | 2.4% |
| 27 | 10225 | 2.4% |
| 34 | 10224 | 2.4% |
| 20 | 10214 | 2.4% |
| 6 | 10211 | 2.4% |
| Other values (35) | 318925 |
| Value | Count | Frequency (%) |
| 1 | 10244 | |
| 2 | 10238 | |
| 3 | 9036 | |
| 4 | 10272 | |
| 5 | 8999 | |
| 6 | 10211 | |
| 7 | 9762 | |
| 8 | 9895 | |
| 9 | 8867 | |
| 10 | 10315 |
| Value | Count | Frequency (%) |
| 45 | 9637 | |
| 44 | 7169 | |
| 43 | 6751 | |
| 42 | 6953 | |
| 41 | 10088 | |
| 40 | 10017 | |
| 39 | 9878 | |
| 38 | 7362 | |
| 37 | 7206 | |
| 36 | 6222 |
Dept
Real number (ℝ)
| Distinct | 81 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.260317 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 18 |
| median | 37 |
| Q3 | 74 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 30.492054 |
|---|---|
| Coefficient of variation (CV) | 0.68892534 |
| Kurtosis | -1.2155706 |
| Mean | 44.260317 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.35822319 |
| Sum | 18658822 |
| Variance | 929.76536 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 6435 | 1.5% |
| 16 | 6435 | 1.5% |
| 92 | 6435 | 1.5% |
| 38 | 6435 | 1.5% |
| 40 | 6435 | 1.5% |
| 2 | 6435 | 1.5% |
| 82 | 6435 | 1.5% |
| 46 | 6435 | 1.5% |
| 95 | 6435 | 1.5% |
| 81 | 6435 | 1.5% |
| Other values (71) | 357220 |
| Value | Count | Frequency (%) |
| 1 | 6435 | |
| 2 | 6435 | |
| 3 | 6435 | |
| 4 | 6435 | |
| 5 | 6347 | |
| 6 | 5986 | |
| 7 | 6435 | |
| 8 | 6435 | |
| 9 | 6354 | |
| 10 | 6435 |
| Value | Count | Frequency (%) |
| 99 | 862 | 0.2% |
| 98 | 5836 | |
| 97 | 6278 | |
| 96 | 4854 | |
| 95 | 6435 | |
| 94 | 5685 | |
| 93 | 5913 | |
| 92 | 6435 | |
| 91 | 6435 | |
| 90 | 6435 |
Date
Date
| Distinct | 143 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 MiB |
| Minimum | 2010-02-05 00:00:00 |
|---|---|
| Maximum | 2012-10-26 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
IsHoliday
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 411.8 KiB |
| False | |
|---|---|
| True | 29661 |
| Value | Count | Frequency (%) |
| False | 391909 | |
| True | 29661 | 7.0% |
Temperature
Real number (ℝ)
| Distinct | 3528 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.090059 |
| Minimum | -2.06 |
|---|---|
| Maximum | 100.14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 69 |
| Negative (%) | < 0.1% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -2.06 |
|---|---|
| 5-th percentile | 27.31 |
| Q1 | 46.68 |
| median | 62.09 |
| Q3 | 74.28 |
| 95-th percentile | 87.27 |
| Maximum | 100.14 |
| Range | 102.2 |
| Interquartile range (IQR) | 27.6 |
Descriptive statistics
| Standard deviation | 18.447931 |
|---|---|
| Coefficient of variation (CV) | 0.30700471 |
| Kurtosis | -0.63592198 |
| Mean | 60.090059 |
| Median Absolute Deviation (MAD) | 13.63 |
| Skewness | -0.32140415 |
| Sum | 25332166 |
| Variance | 340.32616 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50.43 | 709 | 0.2% |
| 67.87 | 646 | 0.2% |
| 72.62 | 594 | 0.1% |
| 76.67 | 583 | 0.1% |
| 70.28 | 563 | 0.1% |
| 76.03 | 555 | 0.1% |
| 50.56 | 544 | 0.1% |
| 64.05 | 542 | 0.1% |
| 64.21 | 519 | 0.1% |
| 50.81 | 487 | 0.1% |
| Other values (3518) | 415828 |
| Value | Count | Frequency (%) |
| -2.06 | 69 | |
| 5.54 | 68 | |
| 6.23 | 69 | |
| 7.46 | 69 | |
| 9.51 | 70 | |
| 9.55 | 69 | |
| 10.09 | 66 | |
| 10.11 | 68 | |
| 10.24 | 69 | |
| 10.53 | 72 |
| Value | Count | Frequency (%) |
| 100.14 | 44 | < 0.1% |
| 100.07 | 46 | < 0.1% |
| 99.66 | 48 | < 0.1% |
| 99.22 | 185 | |
| 99.2 | 46 | < 0.1% |
| 98.43 | 43 | < 0.1% |
| 98.15 | 47 | < 0.1% |
| 97.66 | 42 | < 0.1% |
| 97.6 | 48 | < 0.1% |
| 97.18 | 187 |
Fuel_Price
Real number (ℝ)
| Distinct | 892 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3610265 |
| Minimum | 2.472 |
|---|---|
| Maximum | 4.468 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 2.472 |
|---|---|
| 5-th percentile | 2.653 |
| Q1 | 2.933 |
| median | 3.452 |
| Q3 | 3.738 |
| 95-th percentile | 4.029 |
| Maximum | 4.468 |
| Range | 1.996 |
| Interquartile range (IQR) | 0.805 |
Descriptive statistics
| Standard deviation | 0.45851454 |
|---|---|
| Coefficient of variation (CV) | 0.13642098 |
| Kurtosis | -1.1854045 |
| Mean | 3.3610265 |
| Median Absolute Deviation (MAD) | 0.375 |
| Skewness | -0.1049015 |
| Sum | 1416908 |
| Variance | 0.21023558 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.638 | 2548 | 0.6% |
| 3.63 | 2164 | 0.5% |
| 2.771 | 1917 | 0.5% |
| 3.891 | 1856 | 0.4% |
| 3.594 | 1796 | 0.4% |
| 3.524 | 1793 | 0.4% |
| 3.523 | 1792 | 0.4% |
| 2.72 | 1790 | 0.4% |
| 3.666 | 1778 | 0.4% |
| 2.78 | 1656 | 0.4% |
| Other values (882) | 402480 |
| Value | Count | Frequency (%) |
| 2.472 | 38 | < 0.1% |
| 2.513 | 45 | < 0.1% |
| 2.514 | 906 | |
| 2.52 | 39 | < 0.1% |
| 2.533 | 42 | < 0.1% |
| 2.539 | 37 | < 0.1% |
| 2.54 | 147 | < 0.1% |
| 2.542 | 45 | < 0.1% |
| 2.545 | 38 | < 0.1% |
| 2.548 | 902 |
| Value | Count | Frequency (%) |
| 4.468 | 368 | |
| 4.449 | 358 | |
| 4.308 | 168 | |
| 4.301 | 360 | |
| 4.294 | 363 | |
| 4.293 | 192 | |
| 4.288 | 172 | |
| 4.282 | 173 | |
| 4.277 | 357 | |
| 4.273 | 366 |
MarkDown1
Real number (ℝ)
High correlation  Missing 
| Distinct | 2277 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 270889 |
| Missing (%) | 64.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7246.4202 |
| Minimum | 0.27 |
|---|---|
| Maximum | 88646.76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 0.27 |
|---|---|
| 5-th percentile | 149.19 |
| Q1 | 2240.27 |
| median | 5347.45 |
| Q3 | 9210.9 |
| 95-th percentile | 21801.35 |
| Maximum | 88646.76 |
| Range | 88646.49 |
| Interquartile range (IQR) | 6970.63 |
Descriptive statistics
| Standard deviation | 8291.2213 |
|---|---|
| Coefficient of variation (CV) | 1.1441817 |
| Kurtosis | 17.606263 |
| Mean | 7246.4202 |
| Median Absolute Deviation (MAD) | 3430.74 |
| Skewness | 3.3418447 |
| Sum | 1.0918978 × 109 |
| Variance | 68744351 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.5 | 102 | < 0.1% |
| 460.73 | 102 | < 0.1% |
| 175.64 | 93 | < 0.1% |
| 1282.42 | 75 | < 0.1% |
| 9264.48 | 75 | < 0.1% |
| 686.24 | 75 | < 0.1% |
| 5924.71 | 75 | < 0.1% |
| 1483.17 | 75 | < 0.1% |
| 3124.45 | 74 | < 0.1% |
| 6809.96 | 74 | < 0.1% |
| Other values (2267) | 149861 | |
| (Missing) | 270889 |
| Value | Count | Frequency (%) |
| 0.27 | 51 | |
| 0.5 | 49 | |
| 1.5 | 102 | |
| 1.94 | 50 | |
| 2.12 | 52 | |
| 2.4 | 49 | |
| 2.42 | 50 | |
| 2.43 | 51 | |
| 2.8 | 50 | |
| 2.91 | 51 |
| Value | Count | Frequency (%) |
| 88646.76 | 68 | |
| 78124.5 | 70 | |
| 75149.79 | 73 | |
| 65021.23 | 73 | |
| 62567.6 | 66 | |
| 62172.73 | 72 | |
| 60740.64 | 70 | |
| 60394.73 | 72 | |
| 58928.52 | 72 | |
| 56917.7 | 71 |
MarkDown2
Real number (ℝ)
Missing 
| Distinct | 1499 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 310322 |
| Missing (%) | 73.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3334.6286 |
| Minimum | -265.76 |
|---|---|
| Maximum | 104519.54 |
| Zeros | 207 |
| Zeros (%) | < 0.1% |
| Negative | 1311 |
| Negative (%) | 0.3% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -265.76 |
|---|---|
| 5-th percentile | 1.95 |
| Q1 | 41.6 |
| median | 192 |
| Q3 | 1926.94 |
| 95-th percentile | 16497.47 |
| Maximum | 104519.54 |
| Range | 104785.3 |
| Interquartile range (IQR) | 1885.34 |
Descriptive statistics
| Standard deviation | 9475.3573 |
|---|---|
| Coefficient of variation (CV) | 2.841503 |
| Kurtosis | 37.589561 |
| Mean | 3334.6286 |
| Median Absolute Deviation (MAD) | 184.73 |
| Skewness | 5.4412612 |
| Sum | 3.7097076 × 108 |
| Variance | 89782396 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.91 | 539 | 0.1% |
| 3 | 493 | 0.1% |
| 0.5 | 485 | 0.1% |
| 1.5 | 471 | 0.1% |
| 4 | 367 | 0.1% |
| 6 | 365 | 0.1% |
| 7.64 | 354 | 0.1% |
| 3.82 | 353 | 0.1% |
| 19 | 345 | 0.1% |
| 5.73 | 345 | 0.1% |
| Other values (1489) | 107131 | 25.4% |
| (Missing) | 310322 |
| Value | Count | Frequency (%) |
| -265.76 | 71 | |
| -192 | 72 | |
| -20 | 72 | |
| -10.98 | 60 | |
| -10.5 | 143 | |
| -9.98 | 68 | |
| -9.94 | 62 | |
| -7.6 | 69 | |
| -7.01 | 69 | |
| -6.69 | 69 |
| Value | Count | Frequency (%) |
| 104519.54 | 72 | |
| 97740.99 | 73 | |
| 92523.94 | 73 | |
| 89121.94 | 74 | |
| 82881.16 | 73 | |
| 72413.71 | 72 | |
| 70574.85 | 71 | |
| 58804.91 | 69 | |
| 58046.41 | 71 | |
| 56106.2 | 72 |
MarkDown3
Real number (ℝ)
Missing 
| Distinct | 1662 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 284479 |
| Missing (%) | 67.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1439.4214 |
| Minimum | -29.1 |
|---|---|
| Maximum | 141630.61 |
| Zeros | 67 |
| Zeros (%) | < 0.1% |
| Negative | 257 |
| Negative (%) | 0.1% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -29.1 |
|---|---|
| 5-th percentile | 0.65 |
| Q1 | 5.08 |
| median | 24.6 |
| Q3 | 103.99 |
| 95-th percentile | 1059.9 |
| Maximum | 141630.61 |
| Range | 141659.71 |
| Interquartile range (IQR) | 98.91 |
Descriptive statistics
| Standard deviation | 9623.0783 |
|---|---|
| Coefficient of variation (CV) | 6.6853796 |
| Kurtosis | 77.687772 |
| Mean | 1439.4214 |
| Median Absolute Deviation (MAD) | 22.6 |
| Skewness | 8.399453 |
| Sum | 1.9733172 × 108 |
| Variance | 92603636 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 754 | 0.2% |
| 6 | 710 | 0.2% |
| 2 | 660 | 0.2% |
| 1 | 611 | 0.1% |
| 0.22 | 487 | 0.1% |
| 0.5 | 463 | 0.1% |
| 0.01 | 444 | 0.1% |
| 4 | 439 | 0.1% |
| 3.2 | 379 | 0.1% |
| 1.98 | 363 | 0.1% |
| Other values (1652) | 131781 | |
| (Missing) | 284479 |
| Value | Count | Frequency (%) |
| -29.1 | 72 | < 0.1% |
| -1 | 70 | < 0.1% |
| -0.87 | 46 | < 0.1% |
| -0.2 | 69 | < 0.1% |
| 0 | 67 | < 0.1% |
| 0.01 | 444 | |
| 0.02 | 124 | < 0.1% |
| 0.04 | 241 | |
| 0.05 | 71 | < 0.1% |
| 0.06 | 205 |
| Value | Count | Frequency (%) |
| 141630.61 | 74 | |
| 109030.75 | 75 | |
| 103991.94 | 72 | |
| 101378.79 | 73 | |
| 89402.64 | 71 | |
| 88805.58 | 72 | |
| 83340.33 | 74 | |
| 83192.81 | 74 | |
| 79621.2 | 72 | |
| 77451.26 | 73 |
MarkDown4
Real number (ℝ)
High correlation  Missing 
| Distinct | 1944 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 286603 |
| Missing (%) | 68.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3383.1683 |
| Minimum | 0.22 |
|---|---|
| Maximum | 67474.85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 0.22 |
|---|---|
| 5-th percentile | 28.76 |
| Q1 | 504.22 |
| median | 1481.31 |
| Q3 | 3595.04 |
| 95-th percentile | 12645.96 |
| Maximum | 67474.85 |
| Range | 67474.63 |
| Interquartile range (IQR) | 3090.82 |
Descriptive statistics
| Standard deviation | 6292.384 |
|---|---|
| Coefficient of variation (CV) | 1.8599087 |
| Kurtosis | 29.996815 |
| Mean | 3383.1683 |
| Median Absolute Deviation (MAD) | 1167.55 |
| Skewness | 4.8475 |
| Sum | 4.5661607 × 108 |
| Variance | 39594097 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 280 | 0.1% |
| 4 | 200 | < 0.1% |
| 2 | 197 | < 0.1% |
| 3 | 146 | < 0.1% |
| 47 | 143 | < 0.1% |
| 67.72 | 142 | < 0.1% |
| 657.56 | 141 | < 0.1% |
| 17 | 141 | < 0.1% |
| 8 | 140 | < 0.1% |
| 1330.36 | 140 | < 0.1% |
| Other values (1934) | 133297 | |
| (Missing) | 286603 |
| Value | Count | Frequency (%) |
| 0.22 | 57 | < 0.1% |
| 0.41 | 52 | < 0.1% |
| 0.46 | 48 | < 0.1% |
| 0.78 | 52 | < 0.1% |
| 0.87 | 49 | < 0.1% |
| 0.92 | 45 | < 0.1% |
| 1.5 | 55 | < 0.1% |
| 1.88 | 48 | < 0.1% |
| 1.98 | 44 | < 0.1% |
| 2 | 197 |
| Value | Count | Frequency (%) |
| 67474.85 | 72 | |
| 57817.56 | 74 | |
| 57815.43 | 68 | |
| 53603.99 | 72 | |
| 52739.02 | 72 | |
| 48403.53 | 70 | |
| 48159.86 | 73 | |
| 48086.64 | 72 | |
| 47452.43 | 73 | |
| 46238.28 | 71 |
MarkDown5
Real number (ℝ)
High correlation  Missing 
| Distinct | 2293 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 270138 |
| Missing (%) | 64.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4628.9751 |
| Minimum | 135.16 |
|---|---|
| Maximum | 108519.28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 135.16 |
|---|---|
| 5-th percentile | 715.52 |
| Q1 | 1878.44 |
| median | 3359.45 |
| Q3 | 5563.8 |
| 95-th percentile | 11269.24 |
| Maximum | 108519.28 |
| Range | 108384.12 |
| Interquartile range (IQR) | 3685.36 |
Descriptive statistics
| Standard deviation | 5962.8875 |
|---|---|
| Coefficient of variation (CV) | 1.2881658 |
| Kurtosis | 107.84927 |
| Mean | 4628.9751 |
| Median Absolute Deviation (MAD) | 1702.47 |
| Skewness | 8.1699095 |
| Sum | 7.0097495 × 108 |
| Variance | 35556027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2743.18 | 136 | < 0.1% |
| 1064.56 | 120 | < 0.1% |
| 9083.54 | 75 | < 0.1% |
| 3567.03 | 75 | < 0.1% |
| 3557.67 | 75 | < 0.1% |
| 20371.02 | 75 | < 0.1% |
| 4180.29 | 75 | < 0.1% |
| 1773.53 | 74 | < 0.1% |
| 3932.94 | 74 | < 0.1% |
| 4464.45 | 74 | < 0.1% |
| Other values (2283) | 150579 | |
| (Missing) | 270138 |
| Value | Count | Frequency (%) |
| 135.16 | 65 | |
| 153.04 | 47 | |
| 153.9 | 49 | |
| 164.08 | 52 | |
| 170.64 | 69 | |
| 171.76 | 71 | |
| 180.07 | 64 | |
| 212.75 | 50 | |
| 224.86 | 50 | |
| 227.12 | 48 |
| Value | Count | Frequency (%) |
| 108519.28 | 68 | |
| 105223.11 | 70 | |
| 85851.87 | 68 | |
| 63005.58 | 69 | |
| 58068.14 | 69 | |
| 57029.78 | 68 | |
| 53212.72 | 70 | |
| 37581.27 | 70 | |
| 36430.33 | 71 | |
| 36360.42 | 72 |
CPI
Real number (ℝ)
| Distinct | 2145 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 171.20195 |
| Minimum | 126.064 |
|---|---|
| Maximum | 227.23281 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 126.064 |
|---|---|
| 5-th percentile | 126.49626 |
| Q1 | 132.02267 |
| median | 182.31878 |
| Q3 | 212.41699 |
| 95-th percentile | 221.94156 |
| Maximum | 227.23281 |
| Range | 101.16881 |
| Interquartile range (IQR) | 80.394326 |
Descriptive statistics
| Standard deviation | 39.159276 |
|---|---|
| Coefficient of variation (CV) | 0.22873149 |
| Kurtosis | -1.8297144 |
| Mean | 171.20195 |
| Median Absolute Deviation (MAD) | 41.434863 |
| Skewness | 0.085219285 |
| Sum | 72173605 |
| Variance | 1533.4489 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 129.8555333 | 711 | 0.2% |
| 131.1083333 | 708 | 0.2% |
| 129.8459667 | 707 | 0.2% |
| 130.3849032 | 706 | 0.2% |
| 130.6457931 | 706 | 0.2% |
| 131.0756667 | 706 | 0.2% |
| 130.683 | 706 | 0.2% |
| 130.4546207 | 705 | 0.2% |
| 130.7196333 | 705 | 0.2% |
| 130.737871 | 704 | 0.2% |
| Other values (2135) | 414506 |
| Value | Count | Frequency (%) |
| 126.064 | 678 | |
| 126.0766452 | 679 | |
| 126.0854516 | 675 | |
| 126.0892903 | 682 | |
| 126.1019355 | 686 | |
| 126.1069032 | 681 | |
| 126.1119032 | 682 | |
| 126.114 | 687 | |
| 126.1145806 | 689 | |
| 126.1266 | 683 |
| Value | Count | Frequency (%) |
| 227.2328068 | 63 | |
| 227.214288 | 62 | |
| 227.1693919 | 63 | |
| 227.0369359 | 70 | |
| 227.0184166 | 69 | |
| 226.9873637 | 134 | |
| 226.9735448 | 69 | |
| 226.9688442 | 134 | |
| 226.9662325 | 63 | |
| 226.9239785 | 135 |
Unemployment
Real number (ℝ)
| Distinct | 349 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.9602887 |
| Minimum | 3.879 |
|---|---|
| Maximum | 14.313 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 3.879 |
|---|---|
| 5-th percentile | 5.326 |
| Q1 | 6.891 |
| median | 7.866 |
| Q3 | 8.572 |
| 95-th percentile | 12.187 |
| Maximum | 14.313 |
| Range | 10.434 |
| Interquartile range (IQR) | 1.681 |
Descriptive statistics
| Standard deviation | 1.863296 |
|---|---|
| Coefficient of variation (CV) | 0.23407393 |
| Kurtosis | 2.7312166 |
| Mean | 7.9602887 |
| Median Absolute Deviation (MAD) | 0.858 |
| Skewness | 1.1837426 |
| Sum | 3355818.9 |
| Variance | 3.4718721 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.099 | 5152 | 1.2% |
| 8.163 | 3636 | 0.9% |
| 7.852 | 3614 | 0.9% |
| 7.343 | 3416 | 0.8% |
| 7.057 | 3414 | 0.8% |
| 7.931 | 3400 | 0.8% |
| 7.441 | 3397 | 0.8% |
| 6.565 | 3370 | 0.8% |
| 8.2 | 3361 | 0.8% |
| 6.891 | 3360 | 0.8% |
| Other values (339) | 385450 |
| Value | Count | Frequency (%) |
| 3.879 | 287 | 0.1% |
| 4.077 | 938 | |
| 4.125 | 1831 | |
| 4.145 | 562 | 0.1% |
| 4.156 | 1815 | |
| 4.261 | 1829 | |
| 4.308 | 935 | |
| 4.42 | 1855 | |
| 4.584 | 1988 | |
| 4.607 | 935 |
| Value | Count | Frequency (%) |
| 14.313 | 2636 | |
| 14.18 | 2423 | |
| 14.099 | 2441 | |
| 14.021 | 2263 | |
| 13.975 | 1529 | |
| 13.736 | 2464 | |
| 13.503 | 2661 | |
| 12.89 | 2491 | |
| 12.187 | 2507 | |
| 11.627 | 2502 |
Type
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.1 MiB |
| A | |
|---|---|
| B | |
| C |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 215478 | |
| b | 163495 | |
| c | 42597 | 10.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 421570 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 421570 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 421570 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 215478 | |
| B | 163495 | |
| C | 42597 | 10.1% |
Size
Real number (ℝ)
High correlation 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 136727.92 |
| Minimum | 34875 |
|---|---|
| Maximum | 219622 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 34875 |
|---|---|
| 5-th percentile | 39690 |
| Q1 | 93638 |
| median | 140167 |
| Q3 | 202505 |
| 95-th percentile | 206302 |
| Maximum | 219622 |
| Range | 184747 |
| Interquartile range (IQR) | 108867 |
Descriptive statistics
| Standard deviation | 60980.583 |
|---|---|
| Coefficient of variation (CV) | 0.44599951 |
| Kurtosis | -1.2063459 |
| Mean | 136727.92 |
| Median Absolute Deviation (MAD) | 62140 |
| Skewness | -0.32584977 |
| Sum | 5.7640387 × 1010 |
| Variance | 3.7186315 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39690 | 20802 | 4.9% |
| 39910 | 20597 | 4.9% |
| 203819 | 20376 | 4.8% |
| 219622 | 10474 | 2.5% |
| 126512 | 10315 | 2.4% |
| 205863 | 10272 | 2.4% |
| 151315 | 10244 | 2.4% |
| 202307 | 10238 | 2.4% |
| 204184 | 10225 | 2.4% |
| 158114 | 10224 | 2.4% |
| Other values (30) | 287803 |
| Value | Count | Frequency (%) |
| 34875 | 8999 | |
| 37392 | 9036 | |
| 39690 | 20802 | |
| 39910 | 20597 | |
| 41062 | 6751 | 1.6% |
| 42988 | 7156 | 1.7% |
| 57197 | 9443 | |
| 70713 | 9762 | |
| 93188 | 9864 | |
| 93638 | 9455 |
| Value | Count | Frequency (%) |
| 219622 | 10474 | |
| 207499 | 10062 | |
| 206302 | 10113 | |
| 205863 | 10272 | |
| 204184 | 10225 | |
| 203819 | 20376 | |
| 203750 | 10142 | |
| 203742 | 10214 | |
| 203007 | 10202 | |
| 202505 | 10211 |
Weekly_Sales
Real number (ℝ)
| Distinct | 359464 |
|---|---|
| Distinct (%) | 85.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15981.258 |
| Minimum | -4988.94 |
|---|---|
| Maximum | 693099.36 |
| Zeros | 73 |
| Zeros (%) | < 0.1% |
| Negative | 1285 |
| Negative (%) | 0.3% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -4988.94 |
|---|---|
| 5-th percentile | 59.9745 |
| Q1 | 2079.65 |
| median | 7612.03 |
| Q3 | 20205.853 |
| 95-th percentile | 61201.951 |
| Maximum | 693099.36 |
| Range | 698088.3 |
| Interquartile range (IQR) | 18126.202 |
Descriptive statistics
| Standard deviation | 22711.184 |
|---|---|
| Coefficient of variation (CV) | 1.4211136 |
| Kurtosis | 21.49129 |
| Mean | 15981.258 |
| Median Absolute Deviation (MAD) | 6747.645 |
| Skewness | 3.2620082 |
| Sum | 6.737219 × 109 |
| Variance | 5.1579786 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 353 | 0.1% |
| 5 | 289 | 0.1% |
| 20 | 232 | 0.1% |
| 15 | 215 | 0.1% |
| 12 | 175 | < 0.1% |
| 1 | 169 | < 0.1% |
| 10.47 | 167 | < 0.1% |
| 11.97 | 154 | < 0.1% |
| 2 | 148 | < 0.1% |
| 7 | 146 | < 0.1% |
| Other values (359454) | 419522 |
| Value | Count | Frequency (%) |
| -4988.94 | 1 | < 0.1% |
| -3924 | 1 | < 0.1% |
| -1750 | 1 | < 0.1% |
| -1699 | 1 | < 0.1% |
| -1321.48 | 1 | < 0.1% |
| -1098 | 3 | |
| -1008.96 | 1 | < 0.1% |
| -898 | 1 | < 0.1% |
| -863 | 1 | < 0.1% |
| -798 | 4 |
| Value | Count | Frequency (%) |
| 693099.36 | 1 | |
| 649770.18 | 1 | |
| 630999.19 | 1 | |
| 627962.93 | 1 | |
| 474330.1 | 1 | |
| 422306.25 | 1 | |
| 420586.57 | 1 | |
| 406988.63 | 1 | |
| 404245.03 | 1 | |
| 393705.2 | 1 |
Interactions
Correlations
| CPI | Dept | Fuel_Price | IsHoliday | MarkDown1 | MarkDown2 | MarkDown3 | MarkDown4 | MarkDown5 | Size | Store | Temperature | Type | Unemployment | Weekly_Sales | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CPI | 1.000 | -0.009 | -0.041 | 0.012 | -0.017 | -0.099 | -0.111 | -0.063 | 0.021 | -0.005 | -0.230 | 0.173 | 0.183 | -0.383 | -0.023 |
| Dept | -0.009 | 1.000 | 0.003 | 0.000 | 0.002 | 0.003 | 0.006 | 0.007 | 0.006 | 0.011 | 0.014 | 0.001 | 0.080 | 0.006 | -0.014 |
| Fuel_Price | -0.041 | 0.003 | 1.000 | 0.136 | 0.163 | -0.155 | -0.218 | 0.073 | -0.088 | 0.004 | 0.074 | 0.128 | 0.088 | -0.060 | 0.002 |
| IsHoliday | 0.012 | 0.000 | 0.136 | 1.000 | 0.057 | 0.359 | 0.458 | 0.115 | 0.060 | 0.000 | 0.000 | 0.186 | 0.000 | 0.035 | 0.031 |
| MarkDown1 | -0.017 | 0.002 | 0.163 | 0.057 | 1.000 | 0.206 | 0.154 | 0.759 | 0.508 | 0.499 | -0.212 | 0.002 | 0.172 | 0.064 | 0.192 |
| MarkDown2 | -0.099 | 0.003 | -0.155 | 0.359 | 0.206 | 1.000 | 0.066 | 0.116 | 0.152 | 0.149 | 0.009 | -0.462 | 0.066 | 0.060 | 0.032 |
| MarkDown3 | -0.111 | 0.006 | -0.218 | 0.458 | 0.154 | 0.066 | 1.000 | 0.002 | 0.244 | 0.300 | -0.065 | -0.257 | 0.065 | 0.043 | 0.135 |
| MarkDown4 | -0.063 | 0.007 | 0.073 | 0.115 | 0.759 | 0.116 | 0.002 | 1.000 | 0.380 | 0.288 | -0.039 | 0.141 | 0.066 | 0.038 | 0.112 |
| MarkDown5 | 0.021 | 0.006 | -0.088 | 0.060 | 0.508 | 0.152 | 0.244 | 0.380 | 1.000 | 0.579 | -0.156 | -0.071 | 0.094 | -0.019 | 0.208 |
| Size | -0.005 | 0.011 | 0.004 | 0.000 | 0.499 | 0.149 | 0.300 | 0.288 | 0.579 | 1.000 | -0.160 | -0.043 | 0.851 | -0.066 | 0.290 |
| Store | -0.230 | 0.014 | 0.074 | 0.000 | -0.212 | 0.009 | -0.065 | -0.039 | -0.156 | -0.160 | 1.000 | -0.057 | 0.538 | 0.295 | -0.102 |
| Temperature | 0.173 | 0.001 | 0.128 | 0.186 | 0.002 | -0.462 | -0.257 | 0.141 | -0.071 | -0.043 | -0.057 | 1.000 | 0.123 | 0.030 | -0.020 |
| Type | 0.183 | 0.080 | 0.088 | 0.000 | 0.172 | 0.066 | 0.065 | 0.066 | 0.094 | 0.851 | 0.538 | 0.123 | 1.000 | 0.181 | 0.089 |
| Unemployment | -0.383 | 0.006 | -0.060 | 0.035 | 0.064 | 0.060 | 0.043 | 0.038 | -0.019 | -0.066 | 0.295 | 0.030 | 0.181 | 1.000 | -0.016 |
| Weekly_Sales | -0.023 | -0.014 | 0.002 | 0.031 | 0.192 | 0.032 | 0.135 | 0.112 | 0.208 | 0.290 | -0.102 | -0.020 | 0.089 | -0.016 | 1.000 |
Missing values
Sample
| Store | Dept | Date | IsHoliday | Temperature | Fuel_Price | MarkDown1 | MarkDown2 | MarkDown3 | MarkDown4 | MarkDown5 | CPI | Unemployment | Type | Size | Weekly_Sales | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1 | 2010-02-05 | False | 42.31 | 2.572 | NaN | NaN | NaN | NaN | NaN | 211.096358 | 8.106 | A | 151315 | 24924.50 |
| 1 | 1 | 1 | 2010-02-12 | True | 38.51 | 2.548 | NaN | NaN | NaN | NaN | NaN | 211.242170 | 8.106 | A | 151315 | 46039.49 |
| 2 | 1 | 1 | 2010-02-19 | False | 39.93 | 2.514 | NaN | NaN | NaN | NaN | NaN | 211.289143 | 8.106 | A | 151315 | 41595.55 |
| 3 | 1 | 1 | 2010-02-26 | False | 46.63 | 2.561 | NaN | NaN | NaN | NaN | NaN | 211.319643 | 8.106 | A | 151315 | 19403.54 |
| 4 | 1 | 1 | 2010-03-05 | False | 46.50 | 2.625 | NaN | NaN | NaN | NaN | NaN | 211.350143 | 8.106 | A | 151315 | 21827.90 |
| 5 | 1 | 1 | 2010-03-12 | False | 57.79 | 2.667 | NaN | NaN | NaN | NaN | NaN | 211.380643 | 8.106 | A | 151315 | 21043.39 |
| 6 | 1 | 1 | 2010-03-19 | False | 54.58 | 2.720 | NaN | NaN | NaN | NaN | NaN | 211.215635 | 8.106 | A | 151315 | 22136.64 |
| 7 | 1 | 1 | 2010-03-26 | False | 51.45 | 2.732 | NaN | NaN | NaN | NaN | NaN | 211.018042 | 8.106 | A | 151315 | 26229.21 |
| 8 | 1 | 1 | 2010-04-02 | False | 62.27 | 2.719 | NaN | NaN | NaN | NaN | NaN | 210.820450 | 7.808 | A | 151315 | 57258.43 |
| 9 | 1 | 1 | 2010-04-09 | False | 65.86 | 2.770 | NaN | NaN | NaN | NaN | NaN | 210.622857 | 7.808 | A | 151315 | 42960.91 |
| Store | Dept | Date | IsHoliday | Temperature | Fuel_Price | MarkDown1 | MarkDown2 | MarkDown3 | MarkDown4 | MarkDown5 | CPI | Unemployment | Type | Size | Weekly_Sales | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 421560 | 45 | 98 | 2012-08-24 | False | 72.62 | 3.834 | 7936.20 | 58.38 | 22.00 | 5518.07 | 2291.97 | 191.344887 | 8.684 | B | 118221 | 415.40 |
| 421561 | 45 | 98 | 2012-08-31 | False | 75.09 | 3.867 | 23641.30 | 6.00 | 92.93 | 6988.31 | 3992.13 | 191.461281 | 8.684 | B | 118221 | 346.04 |
| 421562 | 45 | 98 | 2012-09-07 | True | 75.70 | 3.911 | 11024.45 | 12.80 | 52.63 | 1854.77 | 2055.70 | 191.577676 | 8.684 | B | 118221 | 352.44 |
| 421563 | 45 | 98 | 2012-09-14 | False | 67.87 | 3.948 | 11407.95 | NaN | 4.30 | 3421.72 | 5268.92 | 191.699850 | 8.684 | B | 118221 | 605.96 |
| 421564 | 45 | 98 | 2012-09-21 | False | 65.32 | 4.038 | 8452.20 | 92.28 | 63.24 | 2376.38 | 8670.40 | 191.856704 | 8.684 | B | 118221 | 467.30 |
| 421565 | 45 | 98 | 2012-09-28 | False | 64.88 | 3.997 | 4556.61 | 20.64 | 1.50 | 1601.01 | 3288.25 | 192.013558 | 8.684 | B | 118221 | 508.37 |
| 421566 | 45 | 98 | 2012-10-05 | False | 64.89 | 3.985 | 5046.74 | NaN | 18.82 | 2253.43 | 2340.01 | 192.170412 | 8.667 | B | 118221 | 628.10 |
| 421567 | 45 | 98 | 2012-10-12 | False | 54.47 | 4.000 | 1956.28 | NaN | 7.89 | 599.32 | 3990.54 | 192.327265 | 8.667 | B | 118221 | 1061.02 |
| 421568 | 45 | 98 | 2012-10-19 | False | 56.47 | 3.969 | 2004.02 | NaN | 3.18 | 437.73 | 1537.49 | 192.330854 | 8.667 | B | 118221 | 760.01 |
| 421569 | 45 | 98 | 2012-10-26 | False | 58.85 | 3.882 | 4018.91 | 58.08 | 100.00 | 211.94 | 858.33 | 192.308899 | 8.667 | B | 118221 | 1076.80 |